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Nonstructural protein 14 (nsp14) of coronaviruses (CoV) is important for 
viral replication and transcription. The N-terminal exoribonuclease 
(ExoN) domain plays a proofreading role for prevention of lethal 
mutagenesis, and the C-terminal domain functions as a (guanine-N7) 
methyl transferase (N7-MTase) for mRNA capping. The molecular basis 
of both these functions is unknown. Here, we describe crystal 
structures of severe acute respiratory syndrome (SARS)-CoV nsp14 in 
complex with its activator nonstructural protein10 (nsp10) and func- 
tional ligands. One molecule of nsp10 interacts with ExoN of nsp14 to 
stabilize it and stimulate its activity. Although the catalytic core of 
nsp14 ExoN is reminiscent of proofreading exonucleases, the presence 
of two zinc fingers sets it apart from homologs. Mutagenesis studies 
indicate that both these zinc fingers are essential for the function of 
nsp14. We show that a DEEDh (the five catalytic amino acids) motif 
drives nucleotide excision. The N7-MTase domain exhibits a noncanon- 
ical MTase fold with a rare f-sheet insertion and a peripheral zinc 
finger. The cap-precursor guanosine-P3-adenosine-5’,5’-triphosphate 
and S-adenosyl methionine bind in proximity in a highly constricted 
pocket between two ß-sheets to accomplish methyl transfer. Our stud- 
ies provide the first glimpses, to our knowledge, into the architecture 
of the nsp14-nsp10 complex involved in RNA viral proofreading. 
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Co (CoV), belonging to the Coronaviridae family in 
the order Nidovirales (1), are one of the major threats to public 
health. The most notable infections are the severe acute respiratory 
syndrome (SARS) and Middle East respiratory syndrome (MERS) 
caused by the SARS-CoV and MERS-CoV, respectively (2, 3). 
SARS-CoV brought about more than 8,000 infections and 800 
deaths, and MERS-CoV has caused 1,139 cases of infections and 
431 deaths till May 25, 2015 (www.who.int/en/). 

CoVs have the largest genomes among RNA viruses (4). There 
are 14 ORFs in the genome of SARS-CoV. Among these, ORFla 
and ORF1b encode 16 nonstructural proteins (nsp) that pre- 
dominantly play a role in replication and transcription (5). Within 
these nsps, nsp12 functions as a RNA-dependent RNA polymerase 
(RdRp), and nsp8 together with nsp7 functions as a primase and 
confers processivity to polymerization by nsp12 (6-8). More im- 
portantly, the nsp7-nsp8-nsp12 complex can associate with nsp14 
without impacting RNA synthesis (8). This interaction is crucial, 
because nsp14 has been shown to play a pivotal role in decreasing 
the incidence of mismatched nucleotides through its exoribonu- 
clease domain (ExoN) (9-11), a role akin to a proofreading ExoN 
associated with a polymerase. Abrogation of the nsp14 ExoN ac- 
tivity results in enhanced sensitivity to the RNA mutagen 5-fluo- 
rouracil (12, 13). The nsp14—nsp10 complex can exquisitely excise 3’ 
mismatched nucleotides from dsRNA (14). Disturbance of the in- 
teraction between nsp14 and nsp10 has been shown to result in a 
decrease in replication fidelity (15). Consequently, in contrast to the 
general replication fidelity of RNA viruses (10-10), the low 
mutation rate (10-107) of SARS-CoV is tied to the ExoN activity. 
A DEDDh (the five catalytic amino acids) motif drives catalysis by 
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nsp14 that is important for the viral replication and transcription 
(14, 16). In vivo studies using mouse models have demonstrated a 
role for this ExoN activity in viral virulence and pathogenesis (17). 
Nsp14 is highly conserved within the Coronaviridae family. In- 
triguingly, ExoNs also are encoded by RNA viruses belonging to the 
order Nidovirales with genomes larger than 20 kb (4, 18, 19). 

In addition, nsp14 is also known to function as an S-adenosyl 
methionine (SAM)-dependent (guanine-N7) methyl transferase 
(N7-MTase) (20). Assembly of a cap1 structure at the 5’ end of viral 
mRNA assists in translation and evading host defense (21-23). 
Formation of this cap in SARS-CoV requires four sequential re- 
actions. First, nsp13 RNA triphosphatase (RTPase) hydrolyzes 
nascent RNA to yield pp-RNA (24). Then an unknown guanylyl- 
transferase (GTase) hydrolyzes GTP, transfers the product GMP to 
pp-RNA, and creates Gppp-RNA. Then nsp14 methylates the 5’ 
guanine of the Gppp-RNA at the N7 position, followed by meth- 
ylation of the ribose of the first nucleotide at the 2’-O position by 
nsp16 (20, 25). Nsp10 has been shown to activate the 2’-O-MTase 
activity of nspl6 by stabilizing the SAM-binding pocket and 
extending the substrate RNA-binding groove of nsp16 (26, 27). 
Similarly, the ExoN activity of nsp14 is fully unleashed only in the 
presence of nsp10 (14). However, the molecular basis for this ac- 
tivation is poorly understood. 


Significance 


Proofreading exonucleases contributing to replication fidelity in 
DNA viruses and cellular organisms are well known; however, 
proofreading in RNA viruses was unknown until recently. Corona- 
virus nonstructural protein 14 (nsp14) has been shown to function 
as a proofreading exoribonuclease. Additionally, nsp14 shows 
(guanine-N7) methyl transferase activity for viral mRNA capping. 
Both roles are important for viral replication and transcription. Here, 
we report the structures of severe acute respiratory syndrome- 
coronavirus nsp14 in complex with its activator nonstructural pro- 
tein 10 (nsp10) and functional ligands. Structural observations 
coupled with mutagenesis and functional assays provide a better 
understanding of the function of nsp14. Furthermore, the structures 
of the nsp14-nsp10 complex demonstrate several unique niches 
that could be targeted for development of potent antiviral drugs. 
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Fig. 1. Overall structure of the nsp14—nsp10 complex. (A) Domain organization 
of nsp14 and nsp10. Domain boundaries are marked with residue numbers. 
(B) Cartoon representation of the structure of the nsp14-nsp10 heterodimer. 
Nsp10, the ExoN domain, the N7-MTase domain, and the loop at the N terminus 
of N7-MTase are marked red, green, marine, and pink, respectively. Invisible 
residues from 454 to 464 of nsp14 are shown by a dashed line. Magnesium, zinc 
ions, and the N7-MTase substrates SAH and GpppA are shown as spheres and are 
colored magenta, gray, yellow, and light pink, respectively. Three zinc fingers 
(ZF) of nsp14 are highlighted with residues shown as sticks. 


Here, we report the nsp14—nsp10 complex structures and show 
how nsp14 is activated by nsp10. Structural observations coupled 
with mutagenesis and functional assays unveil several previously 
unknown features of nsp14 and provide a better understanding of 
its participation in proofreading and mRNA capping. 


Results and Discussion 


Overall Structure of the nsp14—nsp10 Complex. Full-length nsp14 
was coexpressed with nsp10 in Escherichia coli and purified as a 
preformed complex (Fig. 14 and Fig. S1). The unliganded, SAM- 
bound, and S-adenosyl homocysteine (SAH)-guanosine-P3-adeno- 
sine-5’,5’-triphosphate (GpppA)-bound nsp14—nsp10 complex struc- 
tures were refined to 3.4 A, 3.2 A, and 3.3 A resolutions, respectively 
(Table S1). With the exception of the region encompassing amino 
acids 454464 of nsp14 and the C terminus of nsp10 (residues 132- 
139), all residues of nsp14 and nsp10 could be built in the final model 
(Fig. 1B). The complex structure reveals that one molecule of nsp10 
binds one molecule of nsp14 (Fig. 1B). Furthermore, nsp14 is 
bimodular; amino acids 1-287 fold into the ExoN domain, and amino 
acids 288-527 form the N7-MTase domain. A convoluted loop 
consists of amino acids 288-301, and a break in it could result in 
abolishment of the N7-MTase activity (28). Electron density maps of 
the two active centers are shown in Fig. S2. Nsp10 interacts exclu- 
sively with the ExoN domain of nsp14 (Fig. 1B), as is consistent with 
previous biochemical results showing that nsp10 stimulates the ExoN 
activity without perturbing the N7-MTase activity (14, 25). 


Structure of the nsp14 ExoN Domain. The ExoN domain contains a 
central, twisted B-sheet made up of five B-strands (Fig. 24). The 
strands form a parallel B-sheet with the exception of 63 and are 
flanked by a-helices on either side. Such an arrangement of the core 
structural elements is reminiscent of the structures of the DEDD 
superfamily exonucleases, such as the e subunit of E. coli DNA 
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polymerase III [Protein Data Bank (PDB) code 1J53; Z score of 
11.7, rmsd of 3.0 A over 174 aligned Ca atoms as calculated by 
DALI server (Fig. 2B)] (29). One Mg” ion is observed at its active 
center (Fig. 24). However, three major structural differences set 
nsp14 apart from typical DEDD family exonucleases (Fig. 2A). 
First, the N-terminal Alal-Arg76 forms a long, flexible region that 
interacts with nsp10. Second, a B-hairpin structure containing B5 
and ß6 (Ala119-Asp145) also is observed to interact with nsp10. 
Last, the most striking difference is the presence of a zinc finger on 
either side of the B-sheet. The first zinc finger, comprising of 
Cys207, Cys210, Cys226, and His229, is located between a4 and 
610, and the second zinc finger comprising, His257, Cys261, His264, 
and Cys279, is located between a5 and a6 (Figs. 1B and 2A). 


Comparison of nsp14 ExoN Active Sites with Proofreading Homologs. 
Although the overall structure of the nsp14 ExoN domain has di- 
verged significantly from other proteins, the architecture of the 
catalytic core and active sites resembles those used by DEDD-type 
exonucleases, such as the proofreading ExoN domain of DNA 
polymerase I Klenow fragment (PDB ID code 1KLN) and the ¢ 
subunit of DNA polymerase III (PDB ID code 1J53) of E. coli, 
suggesting a conserved mechanism for catalysis. These proteins and 
nsp14 share a similar two-metal-ion—assisted mechanism for re- 
moval of misincorporated nucleotides (30-32). In the nsp14—nsp10 
complex structure, only one Mg”* coordinated by Asp90 and 
Glu191 is observed. The absence of the second metal ion could be 
attributed to the lack of the substrate or product binding (30). In- 
gredients such as the 3’ end of DNA, metal ions, and the solvent 
molecule required for catalysis are positioned in place by side chains 
of D355, E357, D424, Y497, and D501 of the Klenow fragment 
(Fig. 2C) (33). The ExoN domain of nsp14 exhibits a similar con- 
stellation of acidic amino acids, with one notable difference. A 
conserved aspartate of motif II (34, 35), for instance Asp424 of the 
Klenow fragment and Asp103 of the £ subunit of polymerase III, is 
replaced by Glu191 in nsp14 (Fig. 2C and Fig. S3) (31, 33). In 
addition, His268 categorizes nsp14 as a DEDDh-type exoribonu- 
clease. The position of these catalytic amino acids around a nu- 
cleotide modeled in the active site of ExoN is shown in Fig. 2C. The 
Mg* activates one molecule of water to initiate the nucleophilic 
attack on the phosphorous of the substrate, whereas Mg” facilitates 
the leaving of the product (Fig. 2C). Simultaneously mutating 
Asp90 and Glu92 to alanine impaired the ExoN activity signifi- 
cantly, whereas E191A, H268A, or D273A mutants were severely 
deficient in their ability to degrade RNA, confirming the impor- 
tance of these amino acids in the excision of nucleotides (Fig. 2D). 
Intriguingly, Asp243, previously mistaken as a catalytic residue (5, 
16), is the fifth highly conserved acidic amino acid located between 
motifs II and II (Fig. 2C and Fig. S3) (34). The ExoN activity of 
D243A mutant is completely lost (Fig. 2D). In contrast to reported 
results, the activity of DODA/E92A, D243A, H268A, and D273A 
complexed with nsp10 or not on different substrates has been shown 
to be nearly identical (14, 16, 20). 

Further, we mutated residues of the two zinc fingers of the ExoN 
domain of nsp14 to decipher their functions. Zinc finger 2 is in 
proximity to the catalytic residues. Disruption of this zinc finger via a 
C261A or H264R mutation abolished the enzymatic activity, sug- 
gesting this zinc finger has a role in catalysis (Fig. 2D). In stark 
contrast to mutants of zinc finger 2, none of the mutants of zinc finger 
1 could be expressed as soluble proteins. Inspection of the structure 
reveals that zinc finger 1 contributes the structural stability of nsp14. 

Thus, Asp90, Glu92, Glu191, His268, and Asp273 of nsp14 are 
likely to provide the electrophilic environment necessary for li- 
gand binding and catalysis. In addition, both the zinc fingers of 
the ExoN domain are essential for the function of nsp14. 


Stimulation of ExoN Activity of nsp14 by nsp10. In the absence of 


nsp10, nsp14 cannot catalyze nucleotide excision efficiently (Fig. 2D). 
The structure of the nsp14—nsp10 complex explains this requirement 
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Fig. 2. Comparison of the structure and catalytic residues of nsp14 ExoN domain with proofreading homologs. (A) Cartoon representation of the ExoN domain marked 
with secondary structural elements. The three different regions from other DEDD superfamily exonucleases are indicated by red dashed ellipses. (B) The structure of the 
E. coli e subunit of polymerase III (Pol III) is shown in the same orientation as nsp14 for comparison. Metal ions are shown as spheres, and bound ligands are shown as sticks. 
(C) The active center of the ExoN domain of nsp14. (Upper) Catalytic residues of the ExoN domain of nsp14, the exonuclease domain of DNA polymerase |, and the € 
subunit of DNA polymerase Ill of E. coli are listed in the table. (Lower) Catalytic residues, the modeled substrate AMP, and the mistaken D243 are shown as sticks. Mg® 
observed in the structure and Mg“ modeled are shown as spheres. Dashed lines indicate the hydrogen bonds between Mg? and D90 and E191. (D) Exoribonuclease assays 
for nsp10, nsp14 alone, and nsp14 or nsp14 mutants in complex with nsp10 on 5’-labeled ssRNA of 22 nucleosides (RNA22). The symbol “#" indicates cleavage products. 


of nsp10 for enzymatic activity of nsp14. Known structures of 
nsp10 could be superimposed over the nsp14—nsp10 complex 
with an rmsd of <0.8 A between matching Ca atoms over the 
entire length of nsp10. Two regions of nsp10 contribute the 
major residues for nsp14—nsp10 interaction (Fig. 34). The first 
contact area involves the entire N-terminal loop and helix al 
(Prol—Leu24) of nsp10, which has led to interpretable electron 
density for these first nine residues of nsp10 that were not ob- 
served in previous structures (26, 27, 36, 37). The residues Alal, 
Asn3, and Glu6 of nsp10 stabilize the N terminus of nsp14 
by forming hydrogen bonds with Lys9, Asp10, and Thr5, re- 
spectively, whereas Phel6, Phe19, and Val21 of nsp10 form van 
der Waals interactions with Phe60, Met62, and Tyr64 of nsp14 
(Fig. 3 B and D). The second region of intermolecular interactions is 
extensive and encompasses residues from the loop region following 


V21 L45 
F19  K43 
A1 N3 E6 F16 N40 | 


i A23 i N129 } 
D41 DT26 Y51 
nsp14 


Fig. 3. 


helix «2 and residues around zinc finger 1. Here, a number of com- 
plementary hydrogen bonds are observed; Asn40, Lys43, Leu45, 
Thr58, Ser72, Lys93, and Tyr96 of nsp10 interact with Thr25, His26, 
Cys39, Asp41, Ala23, Tyr51, and His19 of the N terminus of nsp14. A 
salt bridge formed between His80 of nsp10 and Asp126 of nsp14 and 
a hydrogen bond between Cys90 of nsp10 and Asn129 of nsp14 
stabilize the structural elements between ß5 and p6 of nsp14 (Fig. 3 C 
and D). The extensive interaction of nsp10 with nsp14 suggests that 
nsp10 might be necessary to maintain the structural stability of the 
ExoN domain and fully unleash the ExoN activity of nsp14 (14). 
Previous mutagenesis studies had shown that regions of nsp10 
contacting nsp14 and nspl6 overlap substantially (38, 39). 
Comparison of the nsp14—nsp10 complex with the nsp16-nsp10 
complex surprisingly reveals that a significantly larger surface area 
of nsp10 contacts with nsp14 (Fig. S4). The buried solvent- 


F16 


Intermolecular interactions between nsp14 and nsp10. Zinc ions are represented as gray spheres in A-C. (A) The Nsp14 ExoN domain (green) is 


stabilized by nsp10 (red). Two regions of nsp10 (boxed) contribute major interactions with nsp14. 2n1"P'° the first zinc ion of nsp10. (B and C) Interaction 
details by regions 1 (B) and 2 (C). Hydrogen bonds between residues are shown by dashed lines. Residues of nsp14 and nsp10 involved in interaction are 
displayed as green and red sticks, respectively. (D) Schematic representation of the contacts between nsp10 and nsp14. 
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Fig. 4. Structure and methyl transfer mechanism of the nsp14 N7-MTase 


domain. (A) Cartoon representation of the N7-MTase domain marked with 
secondary structural elements. (B) Amino acids within 4 Å of the ligands SAH 
(blue) and GpppA (red) are labeled and numbered. The magenta arrow in- 
dicates the methyl transfer. Dashed lines between residues indicate hydrogen 
bonds. Trp385, Asn386, Phe401, and Phe506 are shown by dashed bonds to 
depict their position below the plane of ligand GpppA. (C) The ability of nsp14 
to methylate N7 of guanine of GpppA-RNA was measured. The results depict 
the efficiency of the conversion of substrate to product (%) and are plotted as 
a bar graph. WT nsp14 and its mutants were complexed with nsp10. 


accessible areas of nsp10 for contacting nsp14 and nsp16 are 2,236 
Å? and 938 A?, respectively. Because nsp10 is encoded in about 
three- to sixfold excess over nsp14 and nsp16, the nsp14—nsp10 
complex and nsp16—nsp10 complex can exist simultaneously. 


Structure of the nsp14 N7-MTase Domain. The N7-MTase domain of 
nsp14 contains an atypical MTase fold (Fig. 44). The central B-sheet 
is made up of five B-strands instead of seven; the canonical 63 and B6 
stands are missing. Among these strands, 62’, B1’, 63’, and B4’ are 
positioned parallel, whereas $8’ runs antiparallel. Contrarily, a 
seldom-observed insertion of a three-stranded antiparallel B-sheet 
between canonical strands B5 and {6 of the central sheet is observed 
in nsp14 (Fig. 44). The presence of such a sheet has been noted in 
the structure of the N7-MTase from Vaccinia virus (PDB ID code 
2VDW). This small 62-sheet is positioned almost perpendicular to 
the central B1-sheet. A cavity between sheets B1 and 62 functions as 
a ligand-binding pocket. Two small helices are embedded in the 
connecting loops of the B1-sheet, and a lone a-helix, al’, is stacked 
against the opposite face of the central B1-sheet. Behind this a-helix 
lies another long a-helix, «2’. The third zinc finger of nsp14 formed 
by Cys452, Cys477, Cys484, and His487 is located at the tip of this 
helix and protrudes from the protein at its C terminal (Figs. 1B and 
4A). Intriguingly, an additional a-helix, «3’ spanning the last 12 
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amino acids (Thr516-Gly527) is observed stabilizing the local hy- 
drophobic environment (Fig. S5). Such a modification at the C ter- 
minus has been observed previously in the nucleic acid-binding SAM- 
dependent MTases (40). Truncation of this region has been shown to 
attenuate greatly or abolish the N7-MTase activity of nsp14 (20). 


Structural Basis for Methylation of N7 of Guanine by nsp14. In 
comparing the structures of unliganded, SAM-bound, and SAH- 
GpppA-bound nsp14—nsp10 complexes, no significant structure 
movement is observed, suggesting that the ligand-binding sites 
are preformed. The ligands bind in a pocket surrounded by 
sheets B1 and £2 and helix a1’ (Fig. 44 and Fig. S6). 

The methyl donor SAM sits above the central B1 sheet in the 
space between the loops connecting strands 62’ with 63’ and £3’ 
with 64’. The purine ring is bound in a hydrophobic environment 
created by amino acids such as Ile332, Phe367, and Val389 (Fig. 
4B). The O81 and O82 of Asp352 form hydrogen bonds with the 
03’ and O2’ atoms of the ribose, respectively (Fig. 4B). Substitution 
of Asp352 with alanine reduces the N7-MTase activity of nsp14 by 
20% (Fig. 4C). Asp331 and Gly333, at the end of $1’, which pre- 
viously were shown to be essential for the N7-MTase activity of 
nsp14 (20), are observed in proximity to the carboxyl group of 
SAM. The carbonyl oxygen of Gly333 forms a hydrogen bond with 
the amino nitrogen of methionine of SAM. Not surprisingly, a 
D331A/G333A double mutation completely abolished the N7- 
MTase activity (Fig. 4C). The Trp292 at the N-terminal loop of N7- 
MTase (Lys288—Asp301) stacks against the ribose for optimal po- 
sitioning of SAM for catalysis. Disruption of the interaction of 
Trp292 with ribose reduced the N7-MTase activity by more than 
50% (Fig. 4 B and C). These results are consistent with those 
reported for the methyltransferase from the Encephalitozoon 
cuniculi (Ecm1), in which mutations disrupting the interaction of 
the N7-MTase with ribose and methionine moieties of SAM im- 
pair activity greatly or completely, but those affecting interactions 
with adenine are not important for activity (41). 

Methyl receptor GpppA binds near SAM. Side chains of Phe401, 
Tyr420, Phe426, Thr428, and Phe506 entrench and hold the purine 
moiety of guanosine in position (Fig. 4B). Among these amino acids, 
Phe426 showed the largest influence on the N7-MTase activity, and 
F426A mutation reduced MTase activity by 50% (Fig. 4C). Asn386 is 
located in immediate proximity to the atoms involved in methyl 
transfer and forms two hydrogen bonds with the guanine moiety to 
help orient it during catalysis. An N386A mutant abolished more 
than 50% of the N7-MTase activity (Fig. 4C). The O8 atom of 
Asn422 is observed to form hydrogen bonds with the O3’ atom of the 
guanosine ribose moiety. Mutating it to alanine does not significantly 
impact the N7-MTase activity. Asn306, Arg310, and Lys336 con- 
tribute the positive potential for binding the triphosphate moiety. In 
particular, N52 of Asn306 as well as Nn1 and Nn2 of Arg310 interact 
with the second phosphate group (Fig. 4B). The importance of these 
side chains in catalysis is underscored by the fact that the R310A 
mutant retained only 20% of the activity, whereas the K336A mutant 
retained 50% (Fig. 4C). Last, Trp385 stacks against the adenosine 
moiety of GpppA, and a W385A mutation reduced the activity by 
more than one third (Fig. 4 B and C). 

The mutations R310A, P335A, K336A, D352A, and Y420A have 
been reported; the changes in the activity of these mutants are 
consistent with our results, except that Y420A was reported to be 
inactive (28), but we found that the L419A/Y420A double mutant 
retained 70% activity. We believe our results to be reasonable, 
based on the location of these two residues. 

Thus, the ligands are held in a highly constricted pocket, en- 
suring that the methyl donor SAM and acceptor GpppA are held 
in close proximity. By enforcing charge and shape complemen- 
tarity, the ligands are oriented in the pocket so that the methyl 
group of SAM is brought in the vicinity of the N7 of guanine to 
realize the methyl transfer using an in-line mechanism, as pre- 
viously proposed for Ecm1 (41). 
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Intriguingly, zinc finger 3 of nsp14 located in the N7-MTase 
domain is spatially isolated from the active site, ruling out a direct 
role for this motif in catalysis (Fig. 44). Consistent with this ob- 
servation, C452A and H487R mutations of zinc finger 3 had only a 
marginal effect on the MTase activity (Fig. 4C). The peripheral 
location of zinc finger 3 may be more suited to forging protein- 
protein interactions, for example binding with the nsp16-nsp10 
complex to accomplish the second methyl transfer reaction essential 
for formation of cap1 structure. 

Raw TLC data of the N7-MTase activity assays are included as 
Fig. S7. 


The Interactions Between the Two Naturally Fused Domains. The 
ExoN and MTase domains of nsp14 are interlinked, and they in- 
teract using hydrophobic interactions (Fig. S8). Three a-helices of 
EXoN stabilize the N-terminal loop (Lys288—Asp301) and the 
B2-sheet of N7-MTase domain, the base of the substrate binding 
pocket of N7-MTase. Ile80 and Val83 of «1, Leul77 of «3, and 
Val282 and Phe286 of «6 form hydrophobic interactions with 
Val294, Tyr296, Pro297, Ile299, Leu411, Pro412, and Leu419. All 
these amino acids are highly conserved within members of the 
coronavirus genus (Fig. S3), highlighting an important conserved 
role for these residues (28). 


Model for the Role of nsp14 in Proofreading and mRNA Capping. To 
envision how nsp14 could participate in the functioning of the 
replication and transcription complex (RTC), we attempted to infer 
the interaction mode between nsp14 and nsp12, the RdRp. The 
polymerization and proofreading sites of DNA polymerases are 
located on different domains or subunits that are typically separated 
by a distance of 30 A or more (42). In contrast, during transcription, 
RNA polymerases carry out both these functions using the same 
active sites (43). In this context, nsp14 is functionally and structur- 
ally more similar to the proofreading exonuclease domain of DNA 
polymerases. Using the known structure of the Klenow fragment of 
DNA polymerase I (PDB ID code 1KLN) as a guide, the ExoN 
domain of nsp14 was placed next to the polymerization site (Fig. 5). 
During instances of mismatch of nucleotides, the nascent strand is 
moved to the ExoN site for excision. As the newly synthesized 
strand is extended, a cap0 structure is first assembled at the 5’ end. 
This assembly would require the enzymatic activities of a RTPase 
contributed by nsp13 (24), a guanylyl transferase, which is yet to be 
identified, and the N7-MTase activity of nsp14. Last, the 2’-O- 
MTase activity of nsp16 would result in the formation of a cap1 
structure at the 5’ end of the newly synthesized RNA. Such a model 
accounts for nsp13 functioning as a helicase for unwinding double- 
stranded nucleic acids and the direct nsp13-nsp12 interactions (44, 
45). Further experiments are necessary to shed more light on the 
relative orientation of the various active sites of the RTC and how 
the RNA is maneuvered for formation of the cap1 structure. 


Conclusions 


The structures of SARS-CoV nsp14 in complex with nsp10 de- 
scribed here reveal that the core structural elements and catalytic 
residues of the proofreading ExoN domain of nsp14 are highly 
similar to those of its eukaryotic and prokaryotic predecessors. 
Furthermore, nsp14 of SARS-CoV has gained some striking and 
remarkable structural features to accomplish additional tasks such 
as the methylation of N7 of guanine of cap precursors. Our studies 
unveil several of these nsp14-specific features. First, nsp10 contacts 
the nsp14 ExoN domain to provide structural support and facilitate 
its ExoN activity for proofreading. Second, two zinc fingers are 
located on the ExoN domain. Our mutagenesis studies show that 
both zinc fingers are essential for the nsp14 function. Third, the N7- 
MTase domain of nsp14 adopts an atypical MTase fold and ropes in 
a second -sheet to bind the substrates in a highly constricted cavity 
to accomplish methyl transfer. This N7-MTase domain carries 
a third zinc finger that is located remote from the active sites. 
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Fig. 5. Model for the role of nsp14 in proofreading and mRNA capping. Nascent 
RNA is synthesized at the polymerization sites (red circle) of the nsp12 RdRp do- 
main (gray) or is mismatch excised at the proofreading site (red circles) of nsp14 
ExoN (green surface). The relative orientation of the polymerization and proof- 
reading domains is built based on the Klenow fragment. The 5’ end of newly 
synthesized mRNA is modified by sequential activities contributed by RTPase of 
nsp13, GTase (currently unknown), N7-MTase of nsp14, and 2’-O-ViTase of nsp16 
(light pink surface, PDB ID code 3R24) for formation of a cap1 structure. 


Combined with previous functional data, this work provides evidence 
for the likely mechanism underlying the involvement of nsp14 in viral 
replication and transcription. 


Materials and Methods 

Protein Production. Full-length nsp10 and nsp14 of SARS-CoV isolate Tor2/FP1- 
10851 were synthesized by GENEWIZ Inc. and cloned into pGEX-6p-1 and 
pRSFDuet-1, respectively. Nsp10 with an N-terminal GST tag and nsp14 with a 
C-terminal His tag were coexpressed in E. coli BL21 (DE3). Cells were grown at 
37 °C and induced with 0.1 mM isopropyl B-p-thiogalactoside when ODeoonm 
reached 0.8. After induction at 16 °C for 18 h, cells were harvested, resuspended, 
and lysed by sonication in 20 mM Hepes (pH 7.0), 150 mM NaCl, 4 mM MgCl, and 
5% (volWol) glycerol. After centrifugation, recombinant protein was purified by 
GST-affinity chromatography. The GST tag of nsp10 was removed by PreScission 
protease. Excess nsp10 was separated from the nsp14-nsp10 complex by HiTrap 
S ion-exchange chromatography (GE Healthcare). 

Mutagenesis was performed using the Easy Mutagenesis System (Transgen 
Biotech). All nsp14 mutants were coexpressed with nsp10 and purified as 
described above. 

Nsp14 alone was purified using Ni-nitrilotriacetate affinity resin (GE 
Healthcare) with a gradient of imidazole for washing and elution. 


Crystallization. Crystallization was performed by the hanging-drop vapor- 
diffusion method at 16 °C. Each crystallization drop consisted of 1 uL of 
nsp14—nsp10 complex (10 mg/mL) mixed with 10 mM DTT and 0.25 mM SAM 
and 1 uL of the mother liquor equilibrated over 200 uL of reservoir solution. 
Diffraction-quality crystals grew in 15% (vol/vol) Tacsimate (pH 7.0), 0.1 M 
Hepes (pH 7.0), and 2% (wt/vol) polyethylene glycol 3,350 after optimization 
by microseeding. Diffraction was further improved by dehydration in 25% 
glycerol for 3 min. To obtain a complex of nsp14—nsp10 with SAM and 
GpppA, crystals were soaked with 50 mM GpppA (New England Biolabs). 


Data Collection and Structure Determination. X-ray diffraction data were col- 
lected at Shanghai Synchrotron Radiation Facility beamlines BL19U and 17B. All 
data were processed with the program HKL3000 (46). The initial phases were 
calculated by the single-wavelength anomalous dispersion method using the 
anomalous signal of zinc atoms with the program PHENIX.autosol (47). A total 
of 10 zinc atoms were found using HYSS (48). After phasing, twofold non- 
crystallographic symmetry was identified and used in subsequent density mod- 
ification, dramatically improving the phases of diffraction data. Two molecules 
of nsp10 (PDB ID code 2G9T) were docked into the map using MOLREP (49). 
Other parts of the complex were manually built in COOT (50) and iteratively 
refined in PHENIX.refine (51). To solve the structure of the unliganded nsp14— 
nsp10 complex or nsp14—nsp10-SAH-GpppA complex, molecular replacement 
was done first in MOLREP using the structure of the nsp14—nsp10-SAM complex 
as the search model, and then the structures were refined in PHENIX.refine. Data 
collection and refinement statistics are listed in Table $1. 


Ma et al. 


Pwo NAS PNAS 


Exoribonuclease Activity Assay. ssRNA made up of 22 nucleotides (RNA22, 
5’-GGGCGAUUAGGAGCUAACUGCG-3’) was used as a substrate for activity as- 
says (16). To obtain 5’-labeled RNA22, it was incubated with T4 polynucleotide 
kinase (New England Biolabs) and y-P?P]-ATP (PerkinElmer). MicroSpin G-25 
columns (GE Healthcare) were used to remove excess y-P2P]-ATP. Later, RNA22 
was extracted with phenol-chloroform and precipitated with ethanol. 

Reaction mixtures contained 300 nM nsp14-nsp10 complex (or nsp14 mutant- 
nsp10 complex or nsp14 alone), 1,000-cpm labeled RNA22, and 300 nM unlabeled 
RNA22 in a buffer made up of 50 mM Hepes (pH 7.0), 50 mM NaCl, 5 mM MgCl, 
and 1 mM DTT. After incubation at 37 °C for 5 or 30 min, the reactions were 
stopped by the addition of an equal volume of loading buffer (96% formamide 
with 10 mM EDTA). Products were separated on 20% 7-M urea-containing 
polyacrylamide gels and visualized through PhosphorImager. 


MTase Activity Assay. DNA fragments including the optimized T7 class II pro- 
moter 2.5 with ATP as initial nucleotide (52) and the 5’-terminal 259 nucleo- 
tides of the SARS-CoV genome (20) were used as a template for in vitro 
transcription. We then used the Vaccinia capping system (except for SAM) (New 
England Biolabs) and «-[°2P]-GTP to label the 5’ terminus of the RNA to G*pppA- 
RNA. To improve the efficiency of the reaction, 0.05 U inorganic pyrophosphatase 
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(New England Biolabs) was added. G-50 Sephadex columns (Roche) were used to 
remove unused a-[?2P]-GTP. RNA was extracted with phenol-chloroform and 
precipitated with ethanol. 

MTase activity was tested as follows: 0.1 ug nsp14—nsp10 complex or nsp14 
mutant-nsp10 complex was mixed with 1,000-com labeled RNA in a buffer 
made up of 50 mM Hepes (pH 7.0), 6 mM KCI, 5 mM DTT, 1 mM MgCl, and 
0.2 mM SAM. After incubation at 37 °C for 6 min, 5 pg nuclease P1 (Sigma) and 
1 mM ZnCl, were added to digest the RNA. Reaction products were spotted on 
polyethylenimine cellulose plates (Merck) to separate G*pppA from capped 
m7G*pppA and were visualized using a PhosphorImager. 

The marker m7G*pppA was prepared as above, except that inorganic 
pyrophosphatase was replaced by 0.2 mM SAM. 
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